Statistical mechanics of RNA folding: a lattice approach 
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We propose a lattice model for RNA based on a self-interacting two-tolerant trail. Self-avoidance 
and elements of tertiary structure are taken into account. We investigate a simple version of the 
model in which the native state of RNA consists of just one hairpin. Using exact arguments and 
Monte Carlo simulations we determine the phase diagram for this case. We show that the denat- 
uration transition is first order and can either occur directly or through an intermediate molten 
phase. 
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I. INTRODUCTION 

In recent years, the diversity of roles played by RNA 
in cellular processes has become increasingly clear 0. 
RNA is a heteropolymer which is build from four types 
of monomers (nucleotides) and as for proteins, a major 
issue is the prediction of the tertiary (folded) structure 
for a given sequence of nucleotides 3J. In some respects, 
this question is easier to answer for RNA than for pro- 
teins. Firstly, the four nucleotides are chemically more 
similar than the twenty amino acids that form proteins. 
Secondly, it has been argued that folding in RNA is 
hierarchical in the sense that the energy scales involved 
in secondary structure elements are larger than those of 
tertiary structure. Therefore much attention, also in the 
physics literature HllHlllEllEIiil, has been de- 
voted to predicting the secondary structure of RNA. 

In most of the existing models, self- avoidance has not 
been fully taken into account. If one also neglects certain 
type of monomer-monomer interactions such as those oc- 
curing in pseudoknots and kissing hairpins , it becomes 
possible to calculate secondary structures with a recur- 
sive algorithm whose complexity only grows as the third 
power of the number of monomers L 5] . Despite its suc- 
cess, this approach has several drawbacks. Firstly, the 
neglect of self-avoidance leads to unphysical properties 
for the radius of gyration 0, especially at low temper- 
atures. Secondly, the neglect of part of the physically 
relevant interactions is often justified a posteriori from 
the observation that (for example) pseudoknots are not 
very common in real RNA. However, it would be more 
attractive to have a model which allows to predict the 
rate of occurence of these structural elements. A first 
attempt alon g th ese lines, but neglecting self-avoidance, 
was made in jl2j . 

In this paper, we propose a lattice model for RNA 
which can take into account both self-avoidance and most 
of the physically relevant interactions. To get a first in- 
sight into the properties of the model, we study here in 
detail a simplified version, which is a lattice variant of 
the model studied in By using rigorous and numeri- 
cal techniques well-known from the study of other lattice 
models of polymers Ts], we find that RNA can exist in 



three phases. In the present work we study the properties 
of these phases and of the transitions between them. In 
a forthcoming publication, we will investigate questions 
such as the probability of formation of pseudoknots. 

This paper is organised as follows. In section II we 
present our lattice model and show how one can include 
several aspects of real RNA in a natural way. We also 
introduce the simplified version of the model. In section 
III we present a number of semi-exact results from which 
the phase diagram for this case can be obtained. In sec- 
tion IV we give the results of an extensive Monte Carlo 
study of this phase diagram in two dimensions. Finally, 
in section V, we present our conclusions. 



II. AN INTERACTING TWO-TOLERANT TRAIL 
MODEL OF RNA 

RNA is a heteropolymer whose primary structure con- 
sists of a sequence of ribonucleotide bases. Of these there 
are four types, which are denoted by the symbols C, G, 
U and A. The particular sequence of these bases deter- 
mines the three dimensional structure through base-base 
interactions, of which pairing and stacking are the most 
important. 

We make a lattice model for RNA starting from a 
two-tolerant trail This is a lattice random walk 

that can visit each edge of the lattice at most twice. 
The walk has L steps, each of which corresponds to a 
monomer of RNA. Doubly visited edges correspond to 
bonded base pairs. The restriction to allow only doubly 
visited edges thus takes into account the saturation of 
the base-base interactions. In Fig. QJwe show a typical 
two-tolerant trail of 500 steps. When step i and j are on 
the same edge, we say that they are bonded. The two- 
tolerant trail consists of sets of connected bonded steps 
(corresponding to helices in real RNA) alternating with 
sets of singly visited edges (corresponding to loops and 
bulges) With each L-step two-tolerant trail T we 
associate a set Sr whose elements are the bonded steps 
of the trail: Sr — {{hj) ■ hj are bonded}. In most 
of the existing theoretical work on RNA the following 
restriction is put on the bonded steps: when both (i,j) 
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FIG. 1: A two-tolerant trail of 500 steps. Doubly visited edges 
are represented by double lines. 

and («', j') are bonded, one only allows the combinations 
i < j < i' < j' and i < i' < j' < j . The situation in 
which i < i' < j < j' will be referred to as a pseudo- 
knot, even though in the biological literature this class 
of structures is further divided into several types |3| . In 
our model, pseudoknots are treated on the same footing 
as the other types of bonds. 

Next, we associate to each doubly visited edge a pairing 
energy etj which depends on the nature of the bases i 
and j present on the edge. Below we will propose a simple 
form for Sij inspired by the work of and investigate 
the phase diagram of the resulting model. 

Before doing that we comment here on how other phys- 
ically relevant interactions can be introduced in a natural 
way in our model. The most important of these is the 
stacking energy. This is an interaction between neigbour- 
ing bonded base pairs and and it can be modelled by as- 
sociating an energy (Ti^i+i with each element of St- 
The value of this stacking energy depends on the four nu- 
cleotides forming the stack. When the pair {i + l,j — 1) 
is not bonded we put cr^^i+i — 0. 

Because of the semiflexible nature of nucleid acids, it 
is also necessary to add a bending rigidity for which we 
assume the form K{ni — rii+i)'^ with k > 0. Here Hi 
is the unit vector in the direction of the i-th step. The 
total energy associated with the two-tolerant trail T thus 
becomes 

Et = K^ifii - fti+i)"^ + ^ (£ij -I- cr,;^i+ij_ij) (1) 

The thermodynamic properties of the RNA-model can 
then be determined from the partition sum 

Zi=^exp(-/3£;r) (2) 

T 

where the sum is over all L-step two-tolerant trails and 
/3 is the inverse temperature /3 = l/fc^T. Moreover, if 



^ is a particular set of bonded monomer pairs we can 
determine the probability pa.l that this set occurs by 
calculating the ratio 



where Z^ l is given by an expression similar to jSJ with 
the sum restricted to trails T for which A C St- In this 
way, it is possible to calculate, for example, the probabil- 
ity that a pseudoknot appears as a function of tempera- 
ture. 

Within this model we can calculate the thermodynamic 
properties of an RNA molecule with a given sequence of 
bases by taking from the literature the estimated values 
of the relevant pairing and stacking energies. Clearly, 
such a calculation can only be performed numerically. 
Here, we follow another route with the main purpose of 
gaining a more detailed insight into a simplified version of 
our model. Such an approach can give useful information 
against which the results of a full numerical calculation 
can be understood. In the simplified model we neglect 
the stacking energies and the bending rigidity. More- 
over, we modify the nucleotide dependence of the pairing 
energies. A common approach in the physics literature 
is to take it as random |^ Hfl ITlj . In this way, the 
study of RNA can be linked to that of random systems 
such as spin glasses. However, clear differences between 
random and real RNA have been pointed out 15]. The 
latter has an evolutionary evolved, correlated sequence 
of bases, which is such that most often the ground state 
secondary structure is less degenerated than that of ran- 
dom RNA. An expression for the pairing energy which 
has this feature and which lends itself to detailed analy- 
sis was introduced in 6]. Following that work, we take 

£ij = £o + £^t+j,L+i (4) 

(where S is the Kronecker-delta and from now on we take 
L even). In the physical region of interest Eq < and 
e < 0. The second term in Q favors the formation of 
just one hairpin. This structure thus corresponds to the 
native state of our model. With all these simplifications, 
becomes 

Et = Sol + eN (5) 

where / is the total number of bonded base pairs and N 
the total number of native interactions (i.e. those pairs 
i,j for which 6i+j,L+i — !)• Finally, we introduce q = 
exp (— /Jeq) and q — exp (—/3s), so that the partition sum 
(O becomes 

ZL{q,q) = Y.l'^'' (6) 
T 

In the rest of this paper we study the phase diagram of 
the model defined by 10. 
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III. THE PHASE DIAGRAM 

The two-tolerant trail was originally introduced as a 
simple model for the coil-globule transition of homopoly- 
mers The authors of that work studied the two- 

tolerants trails with attractive self-interactions, which 
corresponds to our model for q = I. A closer investi- 
gation revealed however that in the low tempera- 
ture phase, the universal properties of the trail are not 
those of a collapsed globular polymer, but coincide with 
those of branched polymers (BP). In the high tempera- 
ture phase it was found 17] that the universality class of 
two-tolerant trails is that of the self-avoiding walk (SAW) 
[l^ . These results were based on exact enumerations in 
two dimensions and on a study of the model on fractal lat- 
tices. More recently, we investigated the non-interacting 
two-tolerant trail (i.e. q = q = 1) with a Monte Carlo 
method that will be discussed in the next section. We 
also found clear evidence that, at least in two dimensions, 
the non-interacting trail has the critical properties of the 
SAW. Let us therefore assume for the moment that along 
the line q — I, our model has a phase transition at some 
qc(l) > 1 between a SAW-phase and a branched polymer 
phase. A more quantitative investigation of this transi- 
tion will be presented in the next section. We now show 
how this assumption, together with a number of other, 
semi-exact results, leads to a qualitative determination 
of the form of the phase diagram. 

For this purpose, we introduce the free energy f{q,q) 
which is defined as 

f{q,q) ^ }im ylnZLiq^q) (7) 

Besides this free energy it is convenient to introduce the 
connective constant fJ-2{ci,Q) = exp(/(g,q)). The exis- 
tence of the limit in Q can only be proven rigorously for 
q < 1, <7 < 1 13- The proof is based on concatenation 
arguments and is a straightforward extension of that for 
the SAW JJij. On the basis of an exact enumeration we 
recently determined the estimate 

Ai2(l,l) = 3.486 ±.003 (8) 

on the square lattice. Moreover, it can also be proved 
that in the non-interacting case the connective constant 
for two-tolerant rings (i.e. two-tolerant trails whose last 
step ends at the starting point) equals that of all two- 
tolerant trails. Again, this result can be shown by exten- 
sion of the proof of a similar result for self-avoiding walks 
[l9j |. For more details, we refer to In the following, 
we will assume that (fTj) exists. 

Firstly, we investigate the behaviour of the free energy 
at fixed q and as a function of q. Consider therefore the 
limit q = where the only contribution to the free energy 
f{q, 0) comes from trails without native contacts. A spe- 
cial subset of these are the trails in which the first L/2 



steps and the last L/2 steps are in different halfspaces. 
It is known that for quite general type of walks, the free 
energy of walks limited to a halfspace is the same as that 
of unrestricted walks (i.e. f{q, 1) in our case) up to sur- 
face corrections These however vanish in the limit 
L — !■ oo. Thus, the free energy of the trails without native 
contacts is bounded from below by that of trails living in 
a half-space, which equals that of trails without spatial 
restrictions. Therefore, /(g, 1) < /(g,0), Vg. However, 
since the free energy is a non-decreasing function of q we 
must conclude 

/(g,q) = /(<Z,l)=ln/i2(g,l) 0<g<l (9) 

Secondly, it is possible to obtain a lower bound to the 
partition sum ^ by considering only the contribution 
from two-tolerant trails with a maximum number of na- 
tive contacts, i.e. those for which N — L/2. Each of 
these is a walk whose first L/2 steps is a one-tolerant 
trail (i.e. a walk that can visit each edge of the lattice at 
most once), and which then retraces these same steps in 
reverse order during its last L/2 steps. For this type of 
walk I ~ L/2 and therefore 

/(g,g)> i[ln^i+lng + lng] (10) 

Here /ii is the connective constant for non-interacting 
one-tolerant trails. For example, on the square lattice 
one has fn = 2.72058 ± .00020 [13 . 

Together © and H10|) imply the existence of a phase 
transition at some qc{q)- Moreover, one trivially arrives 
at the bounds 

< \nqc{q) < 2 In ^2(9, 1) - Ing- In/^i (11) 

It also follows that for each q fixed, the free energy equals 
the value given by ^ as long as g < qdq)- 

Further information on the upper bound in l|ll|) can 
be obtained from the following reasoning. For q = 1 and 
for q —^ 00, the partition sum is dominated by trails in 
which each edge is visited twice. Each such trail of L 
steps has the appearance of a weakly embedded bond 
lattice animal [13 whose L /2 bonds correspond with the 
doubly visited edges as we show for an example in Fig. |21 
But in the same figure we show that the mapping is not 
one-to-one. If we suppose that the number of trails that 
are mapped onto the same lattice animal is not extensive 
in L we can conclude that for q 3> 1 

ln^2(g, 1) ~ ^ [In^isp + Ing] (12) 

Here, fj,BP is the connective constant for weakly embed- 
ded bond lattice animals. Its value on the square lattice 
is estimated as ^bp = 5.21 ± .006 [l^. Hence, we can 
make the upper bound 111|) more precise for q — > cx). We 
obtain 

ln(jc('i') < In/iBp — In/ii q ^ co (13) 
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FIG. 2: Two two-tolerant trails with I = L/2 that can be 
mapped onto the same bond lattice animal. 



What is the physical nature of the phase transition 
whose existence we have just proven? To answer this 
question, consider the density of native contacts n((7, q). 



I~N 



L^oo L L^oo LZ[q,q) 



(14) 



The factor 2 ensures that n{q, q) = 1 when the two- 
tolerant trail consists of one single hairpin. In terms of 
the free energy this density equals 



n{q, q) 



,dfiq,q) 



(15) 



Thus, from our earlier results on the free energy we find 
that n((7, q) = for q < qdq) while this density becomes 
strictly positive above qdl)- We therefore interpret the 
phase transition at qdq) as a denaturation transition, i.e. 
a transition into (or out of) the native state. Also notice 
that this transition exist for every value of q. 

As discussed in the beginning of this section, numerical 
evidence shows that there is also a transition between a 
self-avoiding walk regime and a branched polymer (BP) 
one along the line g = 1, at some critical value (Zc(l)- 
Clearly, since the free energy does not depend on q for 
q < qdq), the SAW-BP transition has to be also present 
for some range of g-values, and moreover neither the lo- 
cation of the transition point nor its critical properties 
can depend on q. 

We therefore arrive at the phase diagram shown in Fig. 
13 There are three phases. At low q (or |£o|) and q (or 
|e|), RNA is denaturated and behaves as a coil in the uni- 
versality class of the self-avoiding walk. For q > qd^), 
and for q sufficiently small, there is a collapse into a 
branched polymer or 'molten' phase. Finally for q suf- 
ficiently large, we are in the native, hairpin phase. The 
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FIG. 3: Schematic phase diagram of our model. The thick 
full line is a line of second order transitions, whereas along 
the broken line the transition is first order. The thin full 
lines indicate the three paths along which we investigated the 
model numerically. 



arguments we have given are quite general and we there- 
fore believe that this phase diagram is correct, indepen- 
dently of dimension. While the denaturation transition 
is always present, it is possible that the SAW-BP tran- 
sition disappears above some critical dimension. As an 
example of this we mention that in the model of reference 
where pseudoknots and self-avoidance are neglected, 
and which thus can be seen as a mean-field model, no 
evidence for the coil phase is found, and only the tran- 
sition between the native and molten phase is present. 
It is interesting to remark that in that work it was also 
found that in the molten phase, RNA has the properties 
of (mean-field) branched polymers, as is the case here. 



IV. NUMERICAL RESULTS 

In order to get a more quantitative insight into the 
phase diagram, we have investigated our model in the 
{q, q')-plane with extensive Monte Carlo simulations. Fur- 
ther numerical insight was obtained from an exact enu- 
meration of our model for L < 18. In these numerical 
approaches, we worked on a square lattice. 

For the simulations, we used the pivot algoritm (23| , a 
well-known technique that generates a Markov chain in 
the set of all allowed walks of a certain type. The pivot al- 
gorithm was originally introduced for self-avoiding walks. 
In |2J| we investigate the extension of this method to non- 
interacting two-tolerant trails, discussing such aspects as 
ergodicity of the algorithm, acceptance ratio, autocorre- 
lation times, and so on. 
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Here we have to take into account interactions and do 
this by adding a standard Metropohs step to the algo- 
rithm. Moreover, we implement the program within a 
Multiple Markov Chain approach (MMC) H^. In this 
method several Markov chains at different temperatures 
are run in parallel and at regular intervals an attempt 
is made to switch two trails between Markov chains at 
different temperature. Such an attempt is accepted with 
a probability that is a trivial extension of that of the 
Metropolis algorithm. The MMC approach has the ad- 
vantage that it allows better sampling at lower temper- 
atures, where a standard algorithm may easily get stuck 
near a metastable state. Besides the pivot moves, we also 
found it useful to add some local moves to enhance the 
performance of the algorithm. Typically, at each tem- 
perature we performed 10^ Monte Carlo steps resulting 
in (.5 ~ 1.) X 10® independent configurations over which 
averages are calculated. 

With this approach, we investigated our model on 
the square lattice along three lines. For these, we chose 
q = 1, q — I and q = q^. We now discuss the results. 



The line g = 1. 

On the basis of the phase diagram obtained in the pre- 
vious section, we expect that along this line there is only 
the denaturation transition. Evidence for this transition 
can most easily be found by looking at the density of na- 
tive contacts. In Fig. ^we show our results for n{l,q) 
for two-tolerant trails of different L as obtained from the 
exact enumeration data. We clearly recognise the be- 
haviour predicted in section III, dressed with finite size 
roundings. From the intersections of the curves for differ- 
ent L- values a first estimate for the location of the critical 
point can be made. Moreover, since the transition into 
the native state shares some properties with the adsorp- 
tion transition of a polymer onto a surface, we expect 
that right at the critical point the density of contacts 
scales as n ~ L^"~^, where (fin is a crossover exponent. 
From the exact enumeration data shown in Fig. 0] we es- 
timate ipn ~ .88. However, a study of the same quantity 
with the Monte Carlo approach shows that this estimate 
is still strongly affected by finite size effects. For exam- 
ple, the value for (p„ tends to increase to a value close 
to 1. This suggests that the actual value of (fn is one, 
which would be the case for a first order transition. To 
verify this idea we made histograms for our data for n 
at different values. These are indeed consistent with a 
first order transition. As an example, we show in Fig. 
such an histogram at the transition point. There is clear 
evidence for two peaks, one near n = 0, the other around 
n « .65. In fact, it turns out that from these histograms, 
a rather sharp estimate for qc can be obtained, with the 
result qc = 4.20 ± .02. 




FIG. 4: The number of native contacts as a function of q 
along the line q = 1 for different L-values, as obtained from 
exact enumerations. 
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FIG. 5: Histogram for the density of native contacts at the 
denaturation transition for L = 300 (q = 1). 



The fact that the denaturation transition is first order 
can also be understood with hindsight from a compar- 
ison with known results from DNA-models. Indeed, 
in recent years there has been quite some interest in 
understanding the nature of the denaturation transition 
for that biopolymer. Almost all existing evidence now 
shows that this transition is first order, both in d = 2 and 
d = 3 Hi, li^, 113 ■ When within our model, we divide 
the two-tolerant trail into two halves, we can see them 
as the two strands of a DNA-molecule whose starting 
point is halfway on the two-tolerant trail. The native 
interactions, which are the only ones appearing along 
the line q — 1, can thus be interpreted as interactions 
between homologous bases on the two strands of the 
DNA. In this way, our model with q — 1 becomes in 
a sense the dual of a recently studied lattice model of 
DNA [2^ , and we can therefore expect both models 
to have a similar critical behaviour. 
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FIG. 6: Plot of Y{q, 1) (see text) for different L-values. 

The line q = 1. 

The phase transition between the coil and branched 
polymer regime along the line g = 1 is more difficult to 
analyse. There is no obvious order parameter character- 
ising this transition, since the average number of bonded 
base pairs, (/), is extensive on both sides of the transi- 
tion. 

We therefore investigated two other quantities. Firstly, 
we looked at the specific heat, which for q — 1 equals 



CUq) 



1 
I 



(16) 



While both the exact enumerations and the Monte Carlo 
simulations show that the specific heat has a peak that 
slowly grows with L, it is difficult to obtain reliable es- 
timates for the location of the critical point and for the 
crossover exponent from these data. 

Secondly, it is possible to get information on the 
SAW-BP transition from the ratio Y{q, 1) of the average 
squared end-to-end distance over the average squared ra- 
dius of gyration. It is well known that this is a universal 
quantity so we expect its behaviour to be stepwise as a 
function of q, at least for L oo. In fact, we expect that 
for large q where the two-tolerant trail visits each edge 
twice (see Fig. I^l, the end-to-end distance approaches 
zero and hence Y{q, 1) goes to zero for large L. On the 
other hand, we verified recently that for non-interacting 
two-tolerant trails Y{1, 1) = 7.1235 ± .001 (2J|, fully con- 
sistent with the value for the SAW. Hence along the line 
q — I, Y{q, 1) should assume this value below gc(l), and 
then drop to zero. In Fig. |3 we present our data for 
Y{q,l), which have the expected behaviour, dressed with 
finite size rounding. From these results we are able to 
obtain the most accurate estimate for gc(l) which equals 
2.91 ± .08. 

In Fig. [3 we show data for the squared radius of gy- 
ration Rq{L) as a function of L in the BP-phase. From 
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FIG. 7: Plot of R%iL) versus L in the BP-phase at q 
3.49, 5 = 1. 



this we can obtain the geometric exponent v since 



(17) 



From a fit of the exact enumeration and Monte Carlo 
data in the BP-phase we find ly « .55. This is still far 
from the best known value for two-dimensional branched 
polymers which is = .64075 ± .00015 jSg. This differ- 
ence is probably due to strong corrections to scaling. In- 
deed, this also happens for the non-interacting situation 
where very long trails can be simulated, up to L = 7500. 
From them we obtain — .749 ± .001, as should be ex- 
pected for a walk in the SAW universality class. However, 
this exponent is only recovered for L > 200. We expect 
that the (/-exponent for branched polymers will show up 
if one studies longer trails at temperatures sufficiently be- 
low the transition. But that regime is difficult to probe 
with our Monte Carlo technique. 



The line q = cf' . 

From the phase diagram shown in Fig. |3 from the 
available estimates for (7c(l), and from H13|l we expect 
that along this line two phase transitions will be encoun- 
tered. The first one is the SAW-BP transition, which 
can be analysed most suitably from an investigation of 
the ratio Y{q,q). The BP-native transition on the other 
hand, should show itself through a study of the density 
of native contacts. 

In Fig. IHlwe show our results for Y{q,^). As was 
the case before, the data for different L-values intersect, 
yielding the estimate qc — 3.00±.06. This result is within 
the numerical accuracy the same as that for g = 1 , as was 
predicted in section III. 

From a study of the density of native contacts, we con- 
clude that the transition between the branched polymer 
and the native phase is also first order. Since in this case 
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FIG. 8: Plot of Y{q, ^) for different L-values. 
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FIG. 9: Histogram for the number of native contacts at the 
BP- native transition along the line q = tf' . The data are for 
L = 200 and q = 2.17. 



it is a transition between two rather dense phases, it is 
more diflrcult to obtain a reliable estimate for q^. We 
find qc = 2.17 ± .15. Fig. |5| shows a histogram for the 
number of native contacts at this point. 



V. CONCLUSIONS 

In this paper, we have introduced a lattice model, 
based on a two-tolerant trail that seems well suited to 
investigate thermodynamic properties of RNA. We have 
found that for a simple version of the interaction ener- 
gies, there are three phases and we have investigated in 
detail the transitions between these phases with a Monte 
Carlo method. 

We believe that the structure of the phase diagram 
that we found here is not particular for the choice of 
the interaction energy Q but would be similar also for 
other choices of £ij that break the homogeneity of the 
interaction energies. Indeed, it is known that, at least 
in mean-field theory [l(3 | , the inclusion of a random part 



in the interaction energy gives rise to the appearance at 
low temperature of a spin-glass like phase. This phase 
then plays the role of the native one. It therefore seems 
quite possible that taking into account self-avoidance and 
for rather general choices of non-homogeneous interaction 
energies, one recovers the three phases found here. 

Upon lowering the temperature at fixed values of the 
interaction energies Sij there are thus two possible sce- 
nario's. Either one goes directly into the native regime, 
or one goes through an intermediate molten phase. We 
believe that the first order transition line approaches the 
line g = 1 when q ^ oo, although we could not prove 
this and simulations in this regime are difficult with the 
pivot algorithm. If this belief is correct it seems that the 
molten phase, where a description in terms of homoge- 
neous interactions is correct, can never be the stable one 
at very low temperatures. 

Our results can be used as a starting point for stud- 
ies which are of more interest from the point of view of 
molecular biophysics. Firstly, still within the context of 
the simplified model we are investigating the probabil- 
ity of occurence of pseudoknots. It is intuitively obvious 
that in the hairpin-phase the occurence of pseudoknots 
will be severely surpressed. But one would be interested 
to obtain a more quantitative insight into this issue. 

Another application for which we think that our model 
can be useful is an investigation of the elastic properties 
of RNA. These have been measured recently using mi- 
cromanipulation techniques 30]. The theoretical study 
of elastic properties of biomoleculcs is usually performed 
within simple continuum models such as the worm like 
chain (WLC) [sJl • This is certainly a very good approach 
when effects of self-avoidance can be neglected. For non- 
interacting models of polymers, it is known |32l | that as 
soon as an infinitesimal force is applied, the polymer be- 
comes stretched and in such a regime it can be expected 
that it can be described by the WLC or in terms of di- 
rected polymers. However, for homopolymers below the 
theta-point it has been established that they undergo a 
transition to a stretched phase only for forces greater 
then a critical force Fc > [sslls^ . Then, in the whole 
region where the forces are below this threshold, effects 
of self-avoidance are of importance. We expect that a 
similar scenario might hold within the low-temperature 
phases (BP and native) of RNA. 

Finally, the scenario which we have found here for the 
denaturation transition could be quite general and also 
be of relevance for proteins. Also in that case it is possi- 
ble that depending on the ratios of relevant interactions, 
the denaturation transition takes place immediately, or 
through an intermediate molten phase. 
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